Linguistics and Information Science: a Postscript

نویسندگان

  • Karen Sparck Jones
  • Martin Kay
چکیده

The object of Linguistics and Information Science (Sparck Jones and Kay, 1973) was to show how far the suppposedly natural connection between linguistics and information science existed in practice. We surveyed linguistic theory and computational linguistics to identify approaches potentially applicable to information science, and to information, i.e., document, retrieval in particular; and we investigated the linguistic operations of automatic document retrieval to establish their linguistic sophistication and the extent to which linguistic theories were being, or could be, applied. We also looked for evidence of feedback from automatic information retrieval to linguistics. Our general conclusion was that there was very little actual connection between linguistics and information retrieval. Linguists were preoccupied by concerns rather remote from any practical activity like information retrieval, for example the properties of linguistic theories, and had failed to provide tools of potential utility to retrieval workers. At the same time, in both practice and research in information retrieval, needs which might be met by linguistic theory were not properly specifi-d. In general, the linguistic procedures of automatic information retrieval were found to be very simple, and it was not obvious how useful refined linguistic tools would be, either as aids to automation, or as devices for improving retrieval performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thomas Kuhn’s Structure of Scientific Revolutions (referred to as Structure henceforth) was published in 1962, with a second, enlarged edition (with his Postscript) appearing in 1970

Thomas Kuhn’s ideas, particularly of paradigm, are used with some frequency in information science. The usages of paradigm and the problematic nature of Kuhn’s thought are explored. Alternatives to Kuhn are suggested as a way out of the confusion his thought leads to. Résumé : Les idées de Thomas Kuhn, plus particulièrement le paradigme, sont utilisées assez fréquemment en science de l’informat...

متن کامل

The Linguist's Guide to Statistics Don't Panic

writes: The $64,000 question in computational linguistics these days is: What should I read to learn about statistical natural language processing? I h a ve been asked this question over and over, and each time I have given basically the same reply: there is no text that addresses this topic directly, and the best one can do is nd a good probability-theory textbook and a good information-theory...

متن کامل

Comparative Study of Nominalization in Applied Linguistics and Biology Books

This study explored nominalized expression types in an applied linguistics book and a biology book as 2 distinct disciplines. The books were carefully read, the nominalized expression types were identified, the frequencies of the nominalization types were counted, and eventually chi-square was administered. Results revealed no significant difference in using nominalization. Furthermore, the den...

متن کامل

Semi-Structured File Analysis for Information Integration

This paper describes a PostScript file analyzer for extracting information from Web PostScript documents. Our motivation for studying this problem is the building of an informationintegration system. The information extracted from these semi-structured files can be used to model the contents of Web information sources and to define semantic links between items of information. Extracted informat...

متن کامل

Citation-Based Retrieval for Scholarly Publications

queries against a database of stored, indexed documents. However, many of these search engines have proved ineffective for searching scholarly publications accurately. Researchers have developed autonomous citation indexing agents, such as CiteSeer,4 to search computer-science-related literature online. These agents extract citation information from the literature and store it in a database. Ci...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1976